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Summary 


This  report  evaluates  the  ability  of  16-bit  microcomputers  to  implement  a subset 
of  the  programming  language  Concurrent  Pascal.  The  first  part  of  the  report  de- 
scribes the  most  frequent  virtual  instructions  and  their  implementation:  the 
second  part  evaluates  the  process  switching  time  of  various  microprocessors. 
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This  report  defines  a precise  method  for  comparing  microcomputer 
instruction  sets.  Each  microcomputer  is  judged  by  its  ability  to 
interpret  the  virtual  code  generated  by  the  Concurrent  Pascal  compiler. 
The  standard  of  comparison  is  the  Concurrent  Pascal  interpreter  for  the 
PDP  11/45  minicomputer.  This  report  identifies  the  most  frequent 
virtual  instructions  and  their  implementation  on  the  PDP  11/45  computer. 
It  then  evaluates  the  speed  and  size  of  interpreters  implemented  on 
various  16-bit  microcomputers. 
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1.  ItiTRODUCTIQH 


A new  research  project  at  USC  seeks  to  develop  a programming 
methodology  and  machine  architecture  for  concurrent  programming  on 
microprocessor  networks  with  distributed  storage.  The  starting  point  of 
this  project  is  the  programming  language  Concurrent  Pascal  implemented 
on  the  PDP  11/45  minicomputer  [Brinch  Hansen,  1977a]. 

The  initial  aim  of  the  project  is  to  evaluate  existing 
microcomputers  and  select  one  of  them  for  a uniprocessor  system  with  a 
display  terminal  and  a floppy  disk.  This  microcomputer  system  will  be 
able  to  execute  concurrent  programs  written  in  a subset  of  Concurrent 
Pascal  [Brinch  Hansen,  1977h]. 

The  Concurrent  Pascal  subset  includes 
processes,  monitors,  classes 
routines,  statements,  expressions 
enumerations,  arrays,  records,  queues 
terminal  and  disk  input/output 

It  does  not  include  reals  and  sets  and  cannot  execute  sequential  Pascal 
programs . 

A concurrent  program  will  be  compiled  on  a PDP  11/45  minicomputer 
and  transferred  to  a microcomputer  via  a floppy  disk.  The  compiled  code 
consists  of  virtual  instructions  that  will  be  executed  by  a machine 
program  called  the  interpreter.  Such  an  interpreter  already  exists  in 
the  PDP  11/45. 
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Our  initial  goal  is  to  compare  the  instruction  sets,  input/output 
facilities,  and  prices  of  existing  microcomputer  systems.  This  report 
concentrates  on  the  first  of  these  aspects:  the  efficiency  of  the 
instruction  sets. 

Our  primary  goal  is  to  use  a microprocessor  to  implement  an 
abstract  language  for  concurrent  programming.  Although  the  final 
language  chosen  may  differ  from  Concurrent  Pascal  in  details  it  is 
likely  to  have  many  things  in  common  with  it  (among  them  processes, 
procedures,  arrays,  records,  and  enumeration  types).  The  code  generated 
should  therefore  be  quite  similar  to  that  of  Concurrent  Pascal. 

It  is  therefore  reasonable  to  use  the  Concurrent  Pascal  interpreter 
for  the  PDP  11/45  as  a precise  standard  of  comparison  for  the 
instruction  sets.  This  report  identifies  the  most  frequent  virtual 
Instructions  and  the  machine  code  that  interprets  them  on  the  PDP  11/45 
computer . 

The  report  also  defines  the  machine  code  needed  to  interpret  the 
same  virtual  instructions  on  various  microcomputers.  This  is  used  to 
estimate  the  absolute  size  and  speeds  of  Concurrent  Pascal  interpreters 
for  these  microcomputers. 


Since  our  aim  is  to  develop  new  microcomputer  technology  that  will 
simplify  language  implementation  for  non-trivial  applications  we  will 
ignore  8-bit  microprocessors  and  concentrate  on  16-bit  processors. 
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2.  VlIRTUAL  MACHINE 

Since  the  virtual  machine  spends  most  of  its  time  executing 
sequential  statements  within  a process  we  will  only  describe  the 
sequential  aspects  of  the  virtual  machine  here.  The  virtual  machine  for 
Concurrent  Pascal  is  described  in  more  detail  elsewhere  [Brinch  Hansen, 
1977a], 

In  a microcomputer  network  each  processor  should  perform  a 
reasonably  simple  task  that  does  not  require  too  much  storage.  We  will 
therefore  assume  that  a virtual  instruction  potentially  can  address  the 
entire  store  of  a single  processor.  Each  processor  can  access  its  own 
store  only. 


In  a uniprocessor  system  the  store  is  divided  into  segments  of 
fixed  length 


interpreter 

virtual 

code 

process 

segment 

process 

segment 

The  lengths  of  the  virtual  code  and  the  process  segments  are  determined 
by  the  compiler.  This  is  possible  because  a Concurrent  Pascal  program 
consists  of  a fixed  number  of  processes  without  recursive  procedures. 


Each  process  segment  is  organized  as  a stack  that  grows  from  high 
towards  low  addresses.  The  storage  of  procedure  parameters  and 
variables  in  the  stack  is  shown  below. 
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temporaries 
variables 
dynamic  link 
parameters 

The  store  consists  of  16-bit  words  divided  into  2 bvtes  each.  Words  are 
addressed  by  even  byte  indices.  An  enumeration  type  (or  an  address)  is 
represented  by  a single  word  in  the  stack.  Arrays  and  records  are 
represented  by  one  or  more  words.  A text  string  consists  of  an  even 
number  of  bytes. 

A dynamic  link  of  5 words  defines  the  state  of  the  process  prior  to 
a procedure  call.  The  parameters  and  variables  are  addressed  by 
positive  and  negative  even  displacements  relative  to  the  dynamic  link. 
Temporary  addresses  computed  in  the  stack  are  absolute  within  the  whole 
store . 

A concurrent  program  can  only  access  variables  that  are  either 
local  to  a procedure  or  global  to  a class,  monitor,  or  process. 
Procedures  cannot  be  nested. 

A concurrent  program  is  compiled  into  a single  segment  of  virtual 
code.  Each  virtual  instruction  consists  of  an  operation  code  followed 
by  zero  or  more  arguments.  The  operation  codes  and  their  arguments  each 
occupy  one  word.  The  virtual  code  refers  to  program  labels  and  stack 
variables  by  relative  addresses  only. 

The  Solo  operating  system  for  the  PDP  11/45  computer  is  a typical 
of  a non-trivial  Concurrent  Pascal  program  [Brinch  Hansen, 


example 
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1977a].  A dynamic  analysis  of  the  virtual  code  executed  by  the  Solo 
system  shows  that  9 kinds  of  virtual  instructions  account  for  71  per 
cent  of  all  instructions  executed. 


virtual 

instruction 

dynamic  execution 

frequency  time 

(.%)  (usee) 

weighted 
time 
( usee ) 

code 

piece 

(words) 

pushvar iable 

26 

9 
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index 

7 
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1.3 

12 

pushaddress 
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8 

1.0 
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pushconstant 

9 

6 

0.5 

2 

equalword 

5 

10 

0.5 

6 

copyword 

5 

7 

0.4 

2 

copyby te 

4 

7 

0.3 

2 

enter 

1 

22 

0.2 

10 

exit 

1 

15 

0.2 

8 

71 

6.7 

49 

The  execution 

time  of 

a particular  virtual 

instruction  weighted 

by  its 

frequency  of 

execution  defines  the  contribution  of  that  instruction  to 

the  average  instruction  time.  Since  71  per 

cent  of  all 

instructions 

contribute  6. 

7 usee 

the  averatte  instruction  time  on  the 

POP 

11/45  is 

6.7/0.71  = 

usee , 

The  total 

length 

of  the  selected  code 

pieces  is  49 

words.  The 

whole  interpreter  is  about  1 000  words  long  on  the  PDF  11/45. 


We  will  choose  these  virtual  instructions  as  being  representative 
of  the  50  virtual  instructions  or  so  needed  to  implement  the  Concurrent 
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Pascal  subset.  This  will  be  our  basis  for  evaluating  the  speed  and  size 
of  interpreters  on  other  machines. 

3.  £DP  11/45  INTERPRETER 

In  implementing  the  Concurrent  Pascal  subset,  we  will  ignore  the 
address  mapping  of  the  POP  11/45  computer  and  view  it  as  a 16-bit 
machine  that  can  address  a store  of  64  K bytes  (or  32  K words).  Words 
are  addressed  by  even  (byte)  addresses  [Digital,  1975]. 

The  machine  has  8 word  registers  that  are  used  as  follows  by  the 
interpreter 

w,  X,  y scratch  registers  used  during  the  interpretation  of  a single 
virtual  instruction, 
s the  address  of  the  stack  top. 

b,  g the  base  address  of  the  current  local  and  global  variables, 
q the  virtual  program  counter, 

p the  real  program  counter. 

The  virtual  program  counter  q points  to  the  next  virtual  instruction  (or 
one  of  its  arguments).  The  real  program  counter  p points  to  the  machine 
code  that  interprets  the  virtual  instruction. 


The  interpreter  code  will  be  defined  in  Pascal-like  notation  so 
that  each  line  corresponds  to  one  machine  instruction.  The  store  and 
the  registers  are  declared  as  follows: 

var  byte:  array  [integer]  of  integer; 
w.  X,  y,  b,  g,  q,  s,  p:  integer; 


Among  other  things  the  store  contains  the  vir^jAfiJ.  instructions . 
Consider,  for  example,  the  virtual  instructioi.  that  pushes  an 
enumeration  constant  on  the  stack.  This  instruction  consists  of  two 
words  representing  the  operation  pushconstant  and  a constant  value . 
When  this  instruction  is  executed  the  virtual  instruct’ on  counter  q 
points  at  the  constant  value: 

q 


Since  the  stack  grows  towards  low  addresses  the  interpreter  first 
decrements  the  stack  top  s by  one  word  (2  bytes)  and  copies  the  constant 
value  from  the  virtual  code  to  the  stack.  It  then  increments  the 
virtual  program  counter  q by  one  word  (2  bytes)  to  make  it  point  at  the 
next  operation  code.  All  this  is  done  by  a single  machine  instruction 
which  can  be  defined  as  follows 

s 2;  word(s)  :r  word(q);  q :+  2; 

(An  assignment  such  as  s 2 is  a short-hand  for  s :=  s - 2) 

The  interpreter  contains  a code  piece  for  each  virtual  instruction. 
An  operation  table  defines  the  starting  addresses  of  the  code  pieces. 
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operation  table 
code  piece 


code  piece  | 

The  operation  codes  are  used  as  indices  in  the  operation  table  to  locate 
the  corresponding  code  pieces.  When  one  code  piece  has  been  executed 
the  next  piece  is  located  as  follows: 


virtual  code 


operation  table 


code  piece 


The  virtual  program  counter  q points  to  the  next  operation  code  which  in 
turn  is  the  absolute  address  of  an  entry  in  the  operation  table.  The 
selected  entry  in  the  operation  table  contains  the  address  of  the 
corresponding  code  piece  that  will  perform  the  operation. 


So  the  next  code  piece  is  located  by  a doubly-indirect  jump 
instruction 


P : = 

word ( word ( q ) ) ; q 

:+  2 

» 

Each  code  piece  ends 

with  such  a jump 

to 

the 

next  code  piece. 

This 

single  instruction 

is  the  only  overhead 

of 

code  interpretation  on 

the 

PDF  11/45  computer. 

This  efficient 

form 

of 

code  interpretation 

is 

called  threaded  code 

[Bell,  1973]. 

The  purpose  of 

the  operation  tabl 

e is 

to 

enable  the  compiler 

to 

generate  fixed  operation  codes  that  are  independent  of  any  modifications 

table  must  start 


to  the  code  pieces.  The  only  requirement  is  that  the 
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at  a fixed  absolute  address. 

The  general  notation  used  to  define  a code  piece  is  illustrated 
below ; 

pushconstant ( value ) : 
s 2;  word(s)  :=  word(q);  q :+  2; 
next ; 

It  defines  the  name  and  arguments  of  the  virtual  instruction  and  the 
code  that  interprets  it.  Each  line  corresponds  to  one  machine 
instruction.  The  instruction  next  is  the  indirect  jump  defined  earlier. 

The  rest  of  the  selected  code  pieces  are  defined  below. 

The  virtual  instruction  pushaddress  adds  the  displacement  of  a 
variable  to  the  local  base  address  b and  pushes  the  resulting  absolute 
address  on  the  stack : 

pushaddress(displ) : 
s : - 2 ; word ( s ) : = b ; 
word(s)  :+  word(q);  q :+  2; 
next ; 

When  this  code  piece  is  executed  the  virtual  instruction  counter  q 
points  to  its  argument  (the  displacement).  This  is  a general  rule 
because  the  jump  to  the  next  code  piece  increments  q as  soon  as  it  has 
used  the  operation  code  as  a table  index. 

The  instruction  pushvariable  computes  the  absolute  address  of  an 
enumeration  variable  and  pushes  the  value  of  the  variable  on  the  stack: 


ji 


r 


r 


T • I 


j 


I 

E 


I 
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pushvariable(displ ) : 
w : = b ; 

w ;+  word(q) ; q :+  2; 
s 2;  word(s)  :=  word(w); 
next ; 

The  instruction  coDvword  assigns  the  value  stored  in  the  top  of "the 
stack  to  the  enumeration  variable  whose  address  _i.s,  .s t or'e ci  below  the  top 
of  the  stack.  The  address  and  the  value  of  the  variable  are  popped  from 
the  stack: 

copyword : 

word(word(s  + 2))  :=  word(s);  s :+  4; 
next ; 

The  instruction  coDvbvte  assigns  the  rightmost  byte  stored  in  tne 
top  of  the  stack  to  the  byte  variable  whose  address  is  stored  below  tne 
top  of  the  stack.  The  address  and  the  value  of  tne  variable  are  then 
popped  from  the  stack: 

copybyte : 

byte(word(s  + 2))  :=  byte(s);  s :+  4; 
next ; 

This  instruction  is  included  because  the  processing  of  text  strings  is  a 
fairly  frequent  case  that  is  awKward  to  handle  on  some  16-bit 
microprocessors . 

The  instruction  eoualword  compares  two  words  in  the  top  of  the 


stack  and  replaces  them  with  a boolean  value  ( 1 or  0)  defining  wnether 
or  not  they  are  equal: 
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equalword ; 
w : = 0 ; 

wordCs)  compare  word(s  + 2);  s :+  2; 
if  equal  then 
w : + 1 ; 
word ( s ) : = w ; 
next 

The  comparison  instruction  sets  a condition  register  which  can  be  tested 
only  (but  not  stored  directly).  The  testing  is  done  by  a skip 
instruction  followed  by  an  increment  instruction.  This  is  described 
here  by  an  i£  statement  of  two  lines. 

The  index  instruction  assumes  that  the  stack  already  contains  the 
absolute  address  of  an  array  variable  and  uhe  value  of  an  index  into  the 
array.  The  instruction  checks  that  the  index  i is  within  a fixed  range 
(min  < i < max)  and  replaces  the  array  address  @A  and  the  index  i by  the 
address  of  the  array  element  A(i)  computed  as  follows: 

§A(i)  = §A  + (i  - min)  • length  (where  length  is  the  number  of  bytes 
per  array  element): 
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indexCmin,  max-rain,  length): 
X : = word( s) ; s : + 2 ; 

X : - word( q ) ; q : + 2 ; 


if  less  then 


goto  rangeerror; 

X compare  word(q);  q :+  2; 
if  greater  then 
goto  rangeerror; 

X :*  word(q) ; q :+  2; 
word(s)  :+  x; 
next ; 


Enter  is  the  first  instruction  executed  within  a procedure.  It 
assumes  that  the  stack  already  contains  the  parameters  and  return 
aldress  (old  q)  of  the  call 

s — ► old  q 

parameters 


The  instruction  allocates  stack  space  for  the  dynamic  link  and  the 
variables  of  the  procedure  call 


■j,  .-V  ' 1- 


MICROCOMPUTER  EVALUATION:  INSTRUCTION  SETS 
PDF  n/45  INTERPRETER 


variables 


line  no 


old  s 


old  b 


old  g 
old  q 

parameters 


T 

varlength 

1 


poplength 


enter(notu3ed , poplength,  lineno,  varlength); 
q ;+  2; 

s 2;  word(s)  :=  g;  ^ 

s : - 2 ; word ( s ) : = b ; 

s 2; 

word(s)  :=  s; 

word(s)  :+  word(q);  q ;+  2; 
s 2;  word(s)  :=  word(q);  q :+  2; 


s : - word ( q ) ; q : + 2 ; 
next; 

The  unused  argument  is  irrelevant  in  the  Concurrent  Pascal  subset. 

Exit  is  the  last  instruction  executed  within  a procedure.  It 
removes  the  local  variables,  the  dynamic  link,  and  the  parameters  of  the 
procedure  call  from  the  stack  and  reestablishes  the  previous  values  of 
the  registers  from  the  dynamic  link; 
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exit ; 
s ; = b ; 
s : + 2 ; 

w : = word ( s ) ; s : + 2 
b : = word(s ) ; s : + 2 
g : = word( s ) : s : + 2 
q : = word ( s ) ; s : + 2 

3 : = w ; 

next ; 


IMPLEMENTATION 


Since  Concurrent  Pascal  originally  was  implemented  on  the  PDP  11/45 
the  virtual  code  is  somewhat  biased  by  the  general  features  of  that 
machine  (although  it  does  not  reflect  any  of  its  details). 


In  evaluating  other  machines  it  therefore  makes  sense  to  give  the 
implementor  some  freedom  to  modify  the  virtual  code  slightly  to  use 
these  machines  efficiently.  There  are,  however,  very  good  reasons  for 
limiting  tnis  freedom.  Since  the  long-term  goal  of  this  project  is  to 
invent  simpler  computer  architectures  an  obsession  with  the 
peculiarities  of  existing  machines  would  only  distract  us  from  that 
goal.  Furthermore,  it  is  essential  to  use  the  existing  Concurrent 
Pascal  compiler  witn  as  few  changes  as  possible  to  limit  the  initial 
effort . 
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With  these  conflicting  goals  in  mind  we  have  established  the 
following  constraint's  on  the  implementation  of  interpreters  on 
microcomputers: 

The  following  will  not  change  from  one  machine  to  another 
The  word  length  (16  bits) 

The  format  and  size  of  the  virtual  instructions 
The  format  and  size  of  variables 
The  structure  and  direction  of  the  stack 
The  following  may  be  changed : 

The  operation  codes  (as  long  as  they  are  fixed) 

The  encoding  of  byte  addresses  in  the  virtual  code  and  the  stack 
The  table,  registers,  and  machine  code  used  to  implement  the 
interpreter . 

The  changes  must  only  require  modifications  to  the  last  pass  of  the 
compiler. 


( 

! 
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5.  QA  16/110  MICROCOMPUTER 


5.1  Machine  Properties 

The  General  Automation  16/110  and  16/220  computers  are  single  board 


LSI  microcomputers.  The  machines 
together.  There  are  7 registers:  3 
a base  register.  There  are  two  sepa 
set  is  accessible  at  a time,  so  th 
absolute  addressing  mode  the  first 
Indirect  addressing  and  indexing  may 
mode  an  offset  in  the  range  0 to  31 
base  register  to  determine  the  addre 
indirect  and  indexed  addressing, 
counter  relative  addressing,  and 
Addresses  are  usually  word  addresse 
length,  depending  on  a bit  in 
instructions  use  byte  addresses,  in 
bit  is  used  to  select  a byte  and  the 
The  byte  addressing  within  a word  is 


are  similar  and  will  be  treated 
accumulators,  3 index  registers,  and 
rate  sets  of  registers,  but  only  one 
is  feature  will  be  ignored.  In 
32  words  of  memory  can  be  accessed, 
be  used  with  this  mode.  In  based 
may  be  added  to  the  contents  of  the 
ss.  This  too  may  be  used  with 
Some  instructions  allow  program 
others  allow  immediate  operands, 
s.  They  are  either  15  or  16  bits  in 
the  machine  state.  Two  machine 
which  the  least  significant  address 
rest  is  used  as  a word  address, 
the  opposite  of  the  POP  11s. 


5.2  Interpreter  Implementation 


All  displacements 
stored  in  the  stack 
address  of  a character 


and  increments  will  be  in  words.  Addresses 
will  generally  be  word  addresses,  except  that  the 
within  a string  is  a byte  address. 
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The  Concurrent  Pascal  machine  registers  will  be  allocated  as 
f ollows : 

X,  y accumulators 

s,  w index  registers 

b,  g low  memory  (locations  0 to  31) 

q base  register 


The  transfer  operation  next  uses  the  technique  of  treaded  code. 
Each  virtual  instruction  contains  an  absolute  address  in  the  operation 
table.  The  next  operation  is 
X : =word(word(q) ) ; 
q:+1-} 
p:  =x; 

This  sequence  takes  8.2  usee. 


The  following  describes  the  code  pieces  for  the  nine  most  frequent 
virtual  instructions. 


pushconstant( value) : 

X : =word( q ) ; 
q : + 1 ; 
s:-1 ; 

word(s) : =x; 
next ; 


pushaddress( displ ) : 

X : =word ( q ) ; 
q:  + 1 ; 
y :=b; 

X ; +y  ; 
s:-1 ; 

word( s) : =x ; 
next ; 
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pushvariableldispl)  : 

X : =word ( q ) ; 
q:  + 1 ; 
w : =b ; 
w ; +x ; 

X : =word ( w ) ; 

3 : - 1 ; 

word( s ) : =x ; 

next ; 


copyword : 

X : =word ( s) ; 

3 : +1  ; 

w; =word(s) ; 

3 : +1  ; 

word(w) : =x ; 
next ; 


The  exciu3ive  or  instruction  in  the  following  code  piece  changes  a 

11  byte  index  to  a GA  16/110  byte  index. 

copybyte : 

X : =word( 3 ) ; 

3 ; + 1 ; 

w : =word ( s ) ; 

3 ; + 1 ; 

w : xor  1 ; 

byte(w) ; =right( x) ; 
next ; 


The  if  statement  in  the  following  code  piece  is  a conditional 

instruction.  The  compare  instruction  sets  indicators  for  the  test. 

"equalword ; 

w : =0 ; 

x: =word(s) ; 
s : +1  ; 

X compare  wordCs); 

% if  equal  then 
w : + 1 ; 

word (s ) : =w ; 
next ; 


POP 


jump 
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In  the  next  code  piece,  the  length  parameter  is  the  length  in  words 
except  for  character  arrays.  In  this  case  it  is  zero  and  a byte  address 
is  computed. 


index ( min , max-rain , length ) : 
y : =word ( s ) ; 

3 : + 1 ; 

X : =word(q) ; 
q ; + 1 ; 
y :-x; 

iX  X < 0 then  goto  err; 
y compare  word(q); 
if  greater  then  goto  err; 
q ; +1  ; 

w: =word(s)  ; 

X : =word( q ) ; 
q : + 1 ; 

iX  X = 0 then  goto  A ; 
y :*x; 
goto  B ; 

A:  w:»2; 

B : y : +w  ; 

word(s) : =y ; 
next ; 

err:  goto  rangeerror; 


I 1 


i 


enter(notused,popIength,Iineno,varIength): 

q:  + 1 ; 

3:-1  ; 

x:=g; 

word ( s ) : =x ; 
s:-1  ; 
x:=b; 

word( s ) : =x ; 

s:  -1  ; 

X : =word ( q ) ; 


A . t-O  y 

word ( s ) : =x ; 

3:-1 ; 

X : =word ( q ) ; 

q : + 1 ; 

word ( s ) : =x ; 

b : =3 ; 

X : =word( q ) ; 

q : + 1 ; 

3 : -X ; 

next ; 


( 

t 


r 
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exit : 

w : =b ; 

s : =word ( w+ 1 ) ; 
X : =word ( w+2 ) ; 
b : =x ; 

X : =word ( w+3 ) ; 
g:=x; 

q : =word ( w+4 ) ; 
next ; 


5 . 3 Performance 


The  execution  times  and  space  requirements  are  summarized  below. 


virtual 

instruction 

execution 
t ime 
( usee ) 

weighted 
t ime 
( usee ) 

size 
( words ) 

pushvar iable 

26.2 

6.8 

10 

index 

48.2 

3.4 

22 

pushaddress 

23.6 

3.  1 

9 

pushconstant 

18.9 

1.7 

7 

equalword 

26.6 

1 . 3 

10 

copyword 

21.5 

1 . 1 

8 

copy byte 

24.6 

1 . 0 

9 

enter 

60.7 

.6 

23 

exit 

27.2 

.3 

10 

19.3 

108 

Average  instruction  time:  27.2  usee. 
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LSI  il  MICROCOMPUTER 


The  LSI  1 1 is  a microcoraputer  version  of  the  POP  11  minicomputer. 
It  uses  the  same  basic  instructions  as  the  POP  11/M5.  The  differences 
do  not  affect  the  code  pieces  of  the  interpreter.  Thus  only  the  speeds 
of  the  machines  differ.  The  execution  times  and  space  requirements  are 
summarized  below. 


virtual 

instruction 

execution 
t Ime 
( usee ) 

weighted 
time 
( usee ) 

size 

(words) 

pushvariable 

24.2 

6.3 

4 

index 

77.4 

5.4 

12 

pushaddress 

22.0 

2.9 

3 

pushconstant 

13.6 

1.2 

2 

equalword 

29.8 

1.5 

6 

copy word 

13.0 

.6 

2 

copybyte 

13.0 

.5 

2 

enter 

54.6 

.5 

10 

exit 

39.9 

. 4 

8 

19.3 

49 

Average  instruction  time:  27.2  usee. 


i 
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7.1  Machine  Properties 

Tne  NOVA  computer  family  is  a rarige  of  16  bit  processors,  including 
a single  chip  microprocessor.  All  'execute  the  same  basic  instruction  ) 

set.  Tne  macnine  has  4 registers,  2 of  which  may  be  used  as  index 
registers.  Arithmetic  can  only  be  done  between  register  pairs.  The 
stacK  operations  PUSH  and  POP  are  included  in  the  instruction  set,  but 
the  stack  grows  in  the  opposite  dir»ection  from  the  Concurrent  Pascal 
stack  and  is  therefore  is  not  used  for  bur  purposes.  The  addressing 
modes  of  interest  are  direct  and  based . In  the  direct  mode  the 
instruction  can  directly  access  a word  with  memory  address  0-255-  In 
based  mode  an  offset  in  the  range  -126  to  127  is  added  to  the  contents 
of  one  of  the  index  registers  to  determine  the  me.mory  address.  Indirect 
addressing  may  be  specified  in  addition  to  each  of  these  modes. 

Auto-increment  locations,  memory  words  16-23,  are  incremented 
automatically  after  serving  as  indirect  addresses.  All  addresses  are  15 
bit  word  addresses. 

7.2  Interpreter  Implementation 

All  displacements  and  increments  will  be  in  words.  A variable 
address  stored  in  the  stack  is  usually  an  absolute  word  address.  The 
only  exception  is  the  address  of  a character  within  a string.  This  is  a 
byte  address  which  will  be  interpreted  by  one  of  the  instructions 
Dushbvte  or  coDvbvte . 


22 
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The  Concurrent  Pascal  registers  will  be  allocated  as  follows; 

X,  y registers 

w,  z index  registers 

s,  b,  g low  memory  (below  255) 

q auto-increment  memory  (16-23) 

The  transfer  operation  next  uses  a modified  form  of  threaded  code. 
Each  code  piece  finishes  by  jumping  to  the  next  virtual  instruction, 
which  jumps  to  the  next  code  piece.  An  operation  table  in  low  memory 
contains  tne  absolute  address  of  all  code  pieces.  So  tne  next  operation 
is 

p: =q;  q ; + 1 ; 

and  the  virtual  instruction  is 
p : =word ( addr ) 

where  addr  is  a fixed  address  in  the  operation  table.  The  whole 
sequence  takes  about  9 usee. 

The  following  describes  the  code  pieces  for  the  nine  most  frequent 
virtual  instructions. 

pushconstant ( value ) : 

X : =word ( q ) ; q : + 1 ; 
s;-1 ; 

word ( s ) : =x ; 
next ; 
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pushaddress(displ): 

X : =word(q) ; q :+l ; 
y :=b; 
y :+x; 

3:-1 ; 

word( s) : =y  ; 
next ; 


pushvariable( displ ) : 

X : =word ( q ) ; q : + 1 ; 
w : =b ; 
w : +x ; 

X : =word( w) ; 

3 : -1  ; 

word( 3) ; =x; 
next ; 


copy word : 

X : =word( 3 ) ; 

3 : + 1 ; 

w : =word( 3 ) ; 
3 : + 1 ; 

word ( w ) : rx ; 
next ; 


The  c regi3ter  U3ed  in  the  following  code  piece  is  a one  bit  register. 

The  iX  statements  are  conditional  skip  instructions.  An  octal  constant 

#177400  is  a byte  mask  and  is  stored  in  low  memory. 

copyby te : 

X ; =word( s ) ; 
s : + 1 ; 

y : =word(s) ; 
s : + 1 ; 

w:=y  div  2;  c:=  y mod  2; 
y : =word( w) ; 
if  c = 1 then 

swap  bytes  in  y; 
z:=# 177400; 
y : and  z ; 
y:+x; 

if  c = 1 then 

swap  bytes  in  y; 
word( w) : =y ; 
next ; 


[ S I 

c i- . 


V' 
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equalword : 

w : =0 ; 

X : =word(s) ; 
s : + 1 ; 

y : =word ( s ) ; 
if  X = y then 
w : + 1 ; 

word ( s ) : =w ; 
next ; 


In  the  following  code  piece  the  length  argument  is  the  length  in  words. 
For  character  strings  the  length  will  be  zero.  A byte  address  is  formed 
by  concatenating  a low  order  bit  to  the  word  address. 


index ( min , max-m in , length ) : 
y : =word ( s ) ; 
s : + 1 ; 

X : =word ( q ) ; q : +1 ; 
y • “ X j 

iX  y < 0 then 

goto  rangeerror; 
w : =word ( q ) ; q : +1 ; 
if  y > w then 

goto  rangeerror; 
w : =word ( s ) ; 

X : =word ( q ) ; q : +1  ; 
if  X = 0 then 
w:»2; 

if  X > 1 then 


y : +w ; 

word ( s ) ; =y ; 
next ; 


ts.' 
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enter(notusecl,poplengtli,lineno,varlength): 
q:  + l ; 
s ; - 1 ; 
x;=g; 

word ( s ) : =x ; 
s:-1  ; 
x:=b; 

word ( s ) : =x ; 

• s:-1: 

X : =3 ; 

y :=word(q)  ; q:+1 ; 

X :+y ; 

word ( s ) : =x ; 
s:-1; 

x : =word(q)  ; q : +1 ; 
word(s) ; =x; 

x : =3  ; 

b : =x ; 

y : =word(q) ; q : +1 ; 
x:-y  ; 

3 ; = X ; 

next ; 


exit ; 

w : =b ; 

X : =word(w+ 1 ) ; 
3 : =x ; 

X : =word(w+2 ) ; 

b : =x ; 

x: =word(w+3) ; 
g:=x; 

x : =word(w+4 ) ; 

q:=x: 

next; 


7.3  Performance 


The  execution  timea  and  space  requirements  are  summarized  below. 
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virtual 

instruction 

execution 
t ime 
( usee ) 

we ighted 
time 
( usee ) 

size 

(words) 

pushvariable 

30. 

7.9 

7 

index* 

57.4 

4.0 

18 

pushaddress 

27.5 

3.6 

6 

pushconstant 

22.2 

2.0 

4 

equalword 

31.9 

1.6 

8 

copy word 

27.5 

1.4 

6 

copyby te 

50.  1 

2.0 

15 

enter 

79.  1 

.8 

21 

exit 

34.8 

. 3 

10 

23.6 

95 

•Time  given  is 

for  character 

string  access. 

Others  require  96.3  usee. 

Average  instruction  time:  33.2  usee. 


’h 


3 
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The  Concurrent  Pascal  machine  registers  will  be  allocated  as 
follows : 

X register  0 

y register  1 

w,  3 registers  2 and  3i  which  are  index  registers 

b,  g,  q low  memory 

The  transfer  operation  next  jumps  from  the  end  of  a code  piece  to 
the  next  virtual  instruction,  which  jumps  to  the  next  code  piece.  The  q 
register  is  incremented  as  the  first  operation  of  each  code  piece.  So 
the  next  operation  consists  of  the  steps: 
p:=q; 

p : =word(addr ) ; 
and  q : +1 ; 

Here  addr  is  a fixed  address  in  the  operation  table,  which  is  stored  in 
low  memory.  ' The  whole  sequence,  which  will  be  written  simply  next . 
takes  10.5  usee. 

The  following  describes  the  code  pieces  for  the  nine  most  frequent 
virtual  instructions. 

pushcon3tant(value) : 

3:-1 ; 

x:=word(q) ; 
q:  + 1 ; 

word(s) : =x ; 
next ; 


The  following  code  piece  uses  a one  bit  register  c,  which  can  be  tested 

by  a conditional  branch  instruction.  The  octal  constant  fM77600  is  used 

as  a mask. 

copybyte : 

y:=word(s); 
s : +1  ; 

w: =word(s) ; 
s : + 1 ; 

w:  div  2;  c:=  w mod  2; 

X : =word(w) ; 

if  c = 1 then  goto  A; 

x:  and  #177600; 

y :+x; 

goto  B; 

A:  swap  bytes  in  x; 

x:  and  #177600 
y :+x; 

swap  bytes  in  y; 

B:  word(w) : =y  ; 

next ; 


MICROCOMPUTER  EVALUATION:  INSTRUCTION  SETS 
PACE  MICROCOMPUTER 


31 


Tne  following  code  piece  uses  exclusive  or  (xor)  to  test  for  equality 

since  subtraction  is  not  available. 

equalword ; 

w:  = l ; 

y : =word( s) ; 
s : + 1 ; 

X : =word(s)  ; 

X : xor  y ; 
if  X <>  0 then 
w ; =0 ; 

word( s ) ; =w ; 
next ; 


In  the  following  code  piece  a comment  notes  the  lack  of  a multiplication 

instruction.  Multiplication  must  be  done  by  software. 

indexCmin ,max-min .length) ; 
y : :word(s) ; 
s : + 1 ; 

X : =word( q ) ; 
q : + l ; 

X : =-x ; 

X :+y  ; 

if  X < 0 then  goto  err; 
y :=x; 

X ; =word( q) ; 
q : + 1 ; 

X : =-x ; 

X :+y ; 

if  X > 0 then  goto  err; 
w : =word ( s ) ; 

X : =word(q) ; 
q : + 1 ; 

if  X = 0 then  goto  A; 

" multiply  y:*x  " 
goto  B ; 

A:  w:»2; 

B : w : +y  ; 

word ( 3 ) : =w ; 
next ; 

err:  goto  rangeerror; 
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enter(notusecl,poplength,llneno,varlength): 
q;  + 1 ; 
s:-1 ; 
x:=g; 

word ( s ) : =x ; 
s:-1  ; 

X : =b ; 

word ( 3 ) : =x ; 
s:-1  ; 

X : =word ( q) ; 
q:+1  ; 

X : +3 ; 

word(s ) : =x; 

3;-1; 

X : =word( q ) ; 
q:  + 1 ; 

word(3); =x; 

b : =3 ; 

w: =word(q) ; 
q:  + 1 ; 

X : =-x ; 

3 : +x ; 

next ; 


exit : 

w : =b ; 

3 : =word(w+l ) ; 
X : =word ( w+2 ) ; 
b : =x ; 

X : =word(w+3) ; 
g ••  =x ; 

x: =word(w+4) ; 

q:=x; 

next ; 


8 . 3 Performance 


The  execution  times  and  space  requirements  are  summarized  below 


k 
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virtual 

instruction 

execution 
t ime 
(usee) 

weighted 
time 
( usee ) 

size 
( words ) 

pushvar iable 

33.6 

8.7 

9 

index* 

80.5 

5.6 

24 

pushaddress 

28.0 

3.6 

7 

pushconstant 

25.2 

2.3 

6 

equalword 

3'1.3 

1.7 

10 

copy word 

25.9 

1.3 

7 

copyby te 

51.1 

2.0 

17 

enter 

81.9 

.8 

23 

exit 

32.9 

.3 

10 

26.3  113 


* Time  and  space  for  multiplication  not  included. 
Average  instruction  time:  37.0  usee. 
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9.  QA  kSI  4/10  MICROCOMPUTER 

9.1  Machine  Properties 

The  Computer  Automation  LSI  4/10  computer  is  a single  board  LSI 
microprocessor.  It  has  2 accumulators  and  2 index  registers.  There  are 
four  relevant  addressing  modes.  Short  absolute  addressing  accesses  a 
word  in  memory  locations  0 to  63.  To  this  may  be  added  the  contents  of 
one  or  both  index  registers.  Long  absolute  addressing  allows  any 
address  to  be  selected,  modified  by  one  or  both  index  registers.  Long 
addressing  mode  requires  two  word  instructions.  Some  instructions  are 
available  in  long  or  short  form  only.  Short  indirect  addressing  finds 
the  memory  address  in  low  memory  (locations  0 to  63).  Relative 
addressing  is  used  for  program  counter  relative  jump  instructions. 
Addresses  are  either  (1)  fifteen  bit  word  addresses;  (2)  sixteen  bit 
word  addresses;  or  (3)  sixteen  bit  byte  addresses.  Two  bits  in  the 
machine  state  determine  the  addressing  currently  in  effect.  Byte 
addressing  within  a word  is  opposite  of  the  PDP  11.  This  machine  has  a 
stack,  but  there  are  no  instructions  for  pushing  or  popping  individual 
items,  so  it  is  unsuitable  for  our  purposes  and  is  not  used. 

9.2  Interpreter  Implementation 


All  displacements  and  increments  will  be  words.  The  machine  state 
will  normally  specify  word  addressing.  A variable  stored  in  the  stack 
is  normally  a word  address  except  when  it  refers  to  a character  in  a 
string,  in  which  case  it  is  a byte  address.  The  pushbvte  and  coDvbvte 
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instructions  temporarily  change  the  state  to  byte  addressing. 


Concurrent  Pascal  registers  are  allocated  as  follows: 

X,  y accumulators 
s,  w index  registers 
b,  g,  q low  memory 

The  transfer  operator  next  uses  the  technique  of  threaded  code.  The 
operation  is 

w: =word(word(q) ) ; 

q;  + l ; 

p:=w; 

This  sequence  takes  19.7  usee. 


The  following  describes  the  code  pieces  for  the  nine  most  frequent 
virtual  instructions. 


pushconstant( value) ; 

x : =word(q) ; 
q:+1  ; 
s : - 1 ; 

word ( s ) : =x ; 
next ; 


pushaddres3(displ ) : 

X : =word(q) ; 
q : + 1 ; 

X : +b ; 
s:-1 ; 

word( s ) ; =x ; 
next ; 


r 
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pushvariable(displ) : 
i>v'i  w:=word(q); 

V q:+i; 

w : +b ; 

X : =word ( w) ; 
s;-1; 

word(s); =x; 
next; 


copyword : 

X : =word( 3 ) ; 

3 : + 1 ; 

w ; =word ( 3 ) ; 

3 : + 1 ; 

word ( w ) : =x ; 
next ; 


The  exoIu3ive  or  instruction  in  the  following  code  piece  is  used  to 

change  a LSI  4/10  byte  index  into  a PDP  11  byte  index. 

copybyte : 

X : =word ( s ) ; 

3 ; +1  ; 

w : =word ( s ) ; 
s:  + l ; 
w : xor  1 ; 

change  to  byte  mode; 
byte(w):=right(x); 
change  to  word  mode; 
next ; 


The  iil  statement  in  the  following  code  piece  is  a conditional  jump 

instruction . 

equalword  : 

y:.0; 

X : =word ( s ) ; 

3 : + 1 ; 

X : -word ( s ) ; 
if  X = 0 then 
y ; + 1 ; 

word( 3 ) : =y ; 
next ; 


4. 
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index(inin, max-rain,  length): 
y : =word { s ) ; 

3 : + 1 ; 

y : -word(q  ) ; 

q : -t-l  ; 

if  y < 0 then  .toto  err ; 
X : =word( q ) ; 
q : + 1 ; 
x:-y : 

iX  X > 0 then  goto  err; 
X : rword( s ) ; 
w : =word( q ) ; 

iX  w <>  0 then  goto  A; 
x:*2; 
y :+x; 
goto  B ; 

A:  y : •wordC q ) +x ; 

B : q : +1  ; 

word( s) : =y ; 
next ; 

err:  goto  rangeerror; 


enter(notused,poplength,lineno,varlength): 
q : + 1 ; 
s:-1  ; 
x:=g; 

word ( s ) : =x ; 

3 :-1  ; 
x:=b; 

word ( 3 ) : =x ; 

3 : -1  ; 

X : =word(q) ; 
q:  + l ; 

X : +3 ; 

word(s) : =x; 
s:-1; 

X : =word(q) ; 
q:  + 1 ; 

word ( 3 ) : =x ; 

b : =3 ; 

3 : -word ( q ) ; 
q:  + 1 ; 
next ; 
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exit : 

w : =b ; 

s : =word(w+l ) ; 
X : =worcj(w+2 ) ; 
b : =x ; 

X : iword ( w+3 ) ; 

g::x; 

X : =word (w+4  ) ; 

q;=x; 

next ; 


9 . 3 Performance 


The  execution  times  and  space  requirements  are  summarized  below. 


virtual 

instruction 

execution 
t ime 
( usee ) 

weighted 
t ime 
( usee ) 

size 

(words) 

pushvariable 

46.1 

12.0 

9 

index 

106.9 

7.5 

22 

pushaddress 

42.4 

5.5 

8 

pushconstant 

38.6 

3.5 

7 

equalword 

46.0 

2.3 

10 

copy word 

39.6 

2.0 

8 

copybyte 

57.8 

2.3 

1 1 

enter 

116.2 

1.2 

22 

exit 

51.9 

.5 

1 1 

36.8 

108 

Average  instruction  time:  51.8  usee. 
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10.  11  9900  MICROCOMPUTER 

10. 1 Machine  Properties 

Tne  Texas  Instruments  9900  microcomputer  is  a single  chip 
microprocessor  compatible  with  a family  of  minicomputers.  The  machine 
has  16  general  purpose  registers,  which  are  implemented  in  main  memory. 
A workspace  pointer  contains  the  address  of  the  register  block.  The 
register  mode  specifies  that  the  operand  is  in  a register.  The  register 
indirect  mode  specifies  that  the  address  is  in  a register.  Register 
indirect  auto-increment  mode  specifies  that  the  address  in  in  a 
register,  and  that  the  register  is  to  be  incremented  after  it  is  used. 
Some  instructions  use  other  modes,  such  as  program  counter  relative 
jumps.  Addresses  are  byte  addresses,  and  each  instruction  specifies 
whether  it  operates  on  byte  or  word  data. 


10.2  Interpreter  Implementation 

The  Concurrent  Pascal  registers  are  all  allocated  in  general 
registers.  Addressing  is  identical  to  that  on  the  POP  11.  The  transfer 
operator  next  implements  threaded  code,  as  on  the  POP  11.  The  operation 
table  may  be  located  anywhere  in  memory,  and  the  virtual  instructions 
contain  absolute  address  of  entries  in  this  table.  The  next  operation 
is : 

x:=word(q);  q:+2; 
p : =word( X ) ; 

This  takes  approximately  17  usee. 
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The  following  describes  the  code  pieces  for  the  nine  most  frequent 
virtual  instructions. 


pushconstant{value) : 
s:-2; 

word(s) : =word(q) ; q:+2; 
next ; 


pushaddres3( displ ) : 

x:=word(q);  q:+2; 
s : -2 ; 

X :+b; 

word(s ) : =x; 
next ; 


pushvariable( displ ) : 

x:=word(q);  q:+2; 

X : +b ; 

s:-2; 

word( s ) : =word( x ) ; 
next ; 


copyword : 

X : =word ( s) ; s : +2 ; 
y : =word( s ) ; s : +2 ; 
word(y) : =x; 
next ; 


copybyte : 

x:=word(s);  s:+2; 
y : =word(s) ; s : +2 ; 
byte{y):=right(x); 
next ; 


The  if  statement  in  the  following  code  piece  is  a conditional  skip 
instruction . 
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equalword : 

w : =0 ; 

word(s)  compare  word(s+2);  s:+2; 
if  X = y ijiaa 
w : + 1 ; 

word ( s ) : =w ; 
next ; 


index(min ,max-min , length ) : 

x:=word(s);  s:+2; 
x:-word(q)  ; q:+2; 
if  X < 0 then  goto  err; 

X compare  word(q);  q : +2 ; 
if  greater  then  goto  err 
X :»word(q) ; q:+2; 
word ( s ) : +x ; 
next ; 
err;  goto 

rangeerror ; 


enter( not used, poplength, line no, varlength) 
q;+2; 
s;-2; 

word(s) ; =g; 

3;-2; 

word( s) ; =b; 

3;-2; 

word (s ) ; =s  ; 

word( s) ; +word(q) ; q;+2; 
s;-2; 

word( s) ; =word(q) ; q;+2; 

b : =3 ; 

3 ; -word ( q ) ; q : +2 ; 
next ; 


exit : 

3 ; =b ; 

3 ; + 2 ; 

w : =word ( s ) ; s ; +2 ; 
b:=word(s);  s;+2; 
g;=word(3);  3;+2; 
q : =word ( s ) ; s ; +2 ; 
3;=w; 
next ; 
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10.3  Performance 

The  execution  times  and  space  requirements  are  summarized  below. 


virtual 

instruction 

execution 

time 

(usee) 

weighted 

time 

(usee) 

size 

(words) 

pushvariable 

52.7 

13.7 

6 

index 

92.7 

6.5 

1 1 

pushaddress 

50.7 

6.6 

6 

pushconstant 

36.0 

3.2 

4 

equalword 

53.3 

2.7 

7 

copyword 

49.3 

2.5 

5 

copybyte 

49.3 

2.0 

5 

enter 

117.3 

1.2 

14 

exit 

82.7 

.8 

9 

39.2 

67 

Average  instruction  time:  55.2  usee. 
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n.  PERFORMANCE  COMPARISON 


Machine 

Speed 

(usee) 

Relative 

Size 

PDP  11/45 

10 

100 

GA  16/110 

27 

220 

LSI  1 1 

27 

100 

NOVA 

33 

194 

PACE  (no  mult) 

37 

231 

CA  LSI  4/10 

52 

220 

TI  9900 

55 

137 
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This  is  one  of  several  reports  which  compare  16  bit  microcomputers  with 
respect  to  implementation  of  a Concurrent  Pascal  subset.  This  report 
evaluates  the  process  switching  time  of  various  microprocessors,  that 
is,  the  time  required  to  preempt  one  process  and  resume  another.  This 
influences  the  response  time  of  a microcomputer  to  real-time  events 
(such  as  interrupts). 


1 . INTRODUCTION 


This  is  one  of  several  reports  which  compare  16  bit  microcomputers 
with  respect  to  implementation  of  a Concurrent  Pascal  subset  [1,2]. 
This  report  evaluates  the  process  switching  time  of  various 
microprocessors,  that  is  the  time  required  to  preempt  one  process  and 
resume  another  one.  The  process  switching  time  and  the  scheduling 
policy  of  processes  determine  the  response  time  of  a microcomputer  to 
real-time  events  (such  as  interrupts). 

2.  PROCESS  SCHEDULING 

In  the  implementation  of  the  Concurrent  Pascal  subset,  the 
scheduling  of  processes  is  simplified  as  much  as  possible: 

The  processor  executes  one  process  at  a time.  In  general  a running 
process  continues  its  execution  until  it  either  terminates  or  waits  for 
some  condition  to  be  satisfied.  A process  can  wait  until  a monitor  gate 
is  open,  or  until  a continue  operation  has  been  performed  on  a monitor 
queue  or  until  an  input/output  operation  has  been  completed. 

When  a process  is  not  running  it  waits  in  a ready  queue  which  is 
served  in  first-come . first-served  order. 

A process  waiting  for  one  of  the  conditions  mentioned  earlier  will 
be  resumed  periodically  (just  as  any  other  process).  If  the  process 
finds  that  its  condition  is  satisfied  then  it  will  continue  execution; 
otherwise  it  reenters  the  ready  queue  to  wait  for  another  turn. 
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After  initialization  the  only  operations  on  the  ready  queue  are 
p^reemot  and  resume . 


Preempt  saves  the  register  va  .ues  of  the  running  process  on  top  of 
its  stack  and  enters  the  stack  address  in  the  ready  queue.  If  the  ready 
queue  is  full  the  concurrent  program  terminates  with  an  error  message. 

Resume  restores  the  stack  address  of  a process  from  the  ready 
queue,  restores  the’  register  values  from  the  stack,  and  continues  the 
execution  of  the  process.  If  the  ready  queue  is  empty  the  concurrent 
program  terminates. 


Since  process  switching  takes  place  only  after  the  completion  of  a 
virtual  instruction  it  is  necessary  only  to  save  and  restore  the  values 
of  the  registers  q,  g,  b,  and  s. 


In  Pascal  the  ready  queue  can  be  represented  by  an  array  of  words 
and  three  variables  defining  the  indices  of  the  head  and  tall  elements 
as  well  as  the  length  of  the  queue: 


const  proclim  = capacity  of  ready  queue; 
var 

list:  arravT 1. .orocliml  of  word; 
head,  tail:  1.. proclim; 
length:  0.. proclim; 

procedure  preempt; 
begin 

if  length  = proclim  then  termlnate(proclimit) ; 

s:=  pred(s);  word[s]:=q; 

s:=  pred(s);  word[s]:=g; 

a:=  pred(s);  word[s]:=b; 

list[tail]:=  s; 

tail:=  tail  mod  proclim  + 1; 

length:=  succ(length) 
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procedure  resume; 
begin 

if  length  = 0 then  terminate( terminated ) ; 

s:=  list[head]; 

head : = head  mod  proclim  + 1; 

b:=  word[s];  s:=  succ(s); 

g:=  word[s];  s:=  succ(s); 

q:=  word[s];  s:=  succ(s); 

length:=  pred(length) 

end; 


3.  IMPLEMENTATION 


On  three  of  the  four  machines  evaluated  the  implementation  is  quite 
close  to  the  Pascal  procedures.  The  registers  are  stored  on  the  stack 
just  as  in  a procedure  call.  The  TI  9900  processor,  on  the  other  hand, 
has  a fast  processor  switching  mechanism.  On  this  machine  the  general 
registers  are  stored  in  main  memory.  The  workspace  pointer  is  a 
register  that  defines  the  base  address  of  the  current  register  block. 
On  this  machine  the  variables  head,  tail,  and  length,  as  well  as  needed 
constants  are  stored  in  a separate  register  block  that  is  used  only 
during  process  switching.  The  previous  workspace  pointer  is  stored  in 
the  ready  queue. 


3.1.  QA  16/110  Implementation 


The  variables  length,  head,  and  tail  are  allocated  in  low  memory  so 
they  may  be  accessed  in  absolute  mode.  Several  two-word  immediate 
instructions  are  used. 
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preempt : 

x:=  length; 

X compare  proclim; 

if  equal  then  goto  proclimit; 

s:  -1  ; 

wordCs) : =q ; 
x;=g; 
s:-1  ; 

word ( s ) : =x ; 

X : = b ; 

3;-1  ; 

word(s) : =x; 
w : =tail ; 
word ( w) : =s ; 
w : +1  ; 

w compare  lasttail; 
if  equal  then 
w:-proclim; 
tail : =w ; 
length:+1 ; 
return ; 

resume : 

X : rlength ; 

X compare  0 ; 

if  equal  then  goto  terminated; 
w : =head ; 
s : =word ( w ) ; 
w : + 1 ; 

w compare  lasthead; 
if  equal  then 
w : -proclim ; 
head : =w ; 
length: -1 ; 

X : =word ( s ) ; 

b ; = X ; 

s : + 1 ; 

X : rword ( s ) ; 
g:=x; 

3 : +1  ; 

q : zword ( s ) ; 

3 : + 1 ; 

return ; 
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3.2.  LSI  11  Implementation 

The  variables  length,  head,  and  tail  are  stored  in  memory 
constants  proclira  and  lasttail  are  Immediate  operands. 


preempt : 

length  compare  proclim; 
if  equal  then  goto  proclimit; 
s ; -2  ; word ( s ) : =q ; 
s;-2;  word(s):=g; 
s:-2;  word(s):=b; 
y : =tail ; 

word(y):=s;  y:+2; 
y compare  lasttail; 
if  equal  then 
y ; -2*proclim ; 
tail : =y ; 
length : + 1 ; 
return ; 

resume : 

test  length; 

if  zero  then  goto  terminated; 

X ; =head ; 

s:-word(x);  x: +2; 

X compare  lasthead; 
if  equal  then 
x:-2*proclim; 
head : =x ; 

b : =word ( s ) ; s : +2 ; 
g ; =word ( s ) ; s : +2  ; 
q:=word(s);  s:+2; 
length : -1  ; 
return ; 


3. 3-  NOVA  Implementation 

The  variables  length,  head,  and  tail,  as  well  as  constants 
and  lasttail  are  stored  in  low  memory. 


The 


proclim 
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preempt : 

X : ^length ; 
y : rproclim; 
iX  X = y then 
goto  proclirait; 
s : - 1 : 

word ( 3 ) : =q  ; 
s:-l  ; 

word(s) : =g ; 

3:-1  ; 

word( 3 ) : =b ; 

X : =3 ; 

w : =tail ; 
word ( w) : =x ; 
w : + 1 ; 

X : =la3ttail ; 
iX  X = w then 
w : -y  ; 
tail : =w ; 
length : + 1 ; 
return ; 

reaume : 

X : zlength ; 
if  X = 0 then 

£Qto  terminated; 
w : rhead ; 

X : rword ( w ) ; 

3:  =x; 
w ; +1  ; 

y : =proclira; 

X : zlasthead ; 
iX  X = w then 
w:-y ; 
head : =w ; 

X : =word ( s ) ; 

3 : + 1 ; 

b:=x; 

x: rword(s) ; 

3 : + 1 ; 

g:=x; 

X : =word( 3 ) ; 
s:  + 1 ; 
q:=x; 
return ; 
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3.4.  TI  9900  Implementation 


The  preempt  and  resume  programs  for  the  TI  9900  are  somewhat 
different.  The  work  space  pointer  WP  is  saved  and  restored  from  the  the 
ready  queue  and  the  other  registers  are  saved  and  restored  as  a side 
effect  of  this.  Thus  the  corresponding  Pascal  procedures  are: 


procedure  preempt; 
begin 

if  length  = proclim  then  terminateC proclimit ) ; 
llst[tail] : =WP ; 
tail:=tail  mod  proclim  + 1; 
length:=  succ(length) 

procedure resume ; 
begin 

if  length  = 0 then  terminate( terminated ) ; 

WP : rlist[head ] ; 

head:=  head  mod  proclim  + 1; 

length:=  pred(length) 

ead; 


The  variables  length,  head,  and  tail,  as  well  as  the  constant 
proclim,  are  stored  in  registers.  The  WP  register  contains  the  previous 
value  of  the  workspace  pointer  (before  the  call),  and  the  workspace 
pointer  is  restored  from  the  same  register  by  a return  instruction. 
Using  this  scheme  each  process  requires  a dedicated  register  block  of  16 
words.  This  could  be  allocated  out  of  its  stack  space  at  initialization 
time . 
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preempt : 

length  compare  proclim; 
i£  equal  then  goto  proclimit; 
wordCtail) : =WP;  cail:+2; 
tail  compare  lasttail; 
if  equal  then 
tail:-proclim; 
length:+2; 
return ; 

resume : 

test  length; 

if  zero  then  goto  terminated; 
WP:=  word (head);  head: +2; 
head  compare  lasthead; 
if  equal  then 
head:-proclim; 
length : -2 ; 
return ; 


1. 


1 
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4.  PERFORMANCE 


Machine 

Speed 

(usee) 

Space 
( words ) 

TI  9900 

80 

16 

GA  16/110 

1 1 1 

44- 

4 

NOVA 

120 

42 

LSI  1 1 

143 

39 

Process  switching  time  for  all  machines  takes  about  as  much  time  as 
2 to  5 virtual  instructions.  The  TI  9900  clearly  benefits  from  the  fast 
switching  mechanism  in  its  instruction  set.  Unfortunately  the 
flexibility  that  allowed  this  was  partly  responsible  for  its  poor 
performance  in  interpreting  Concurrent  Pascal  code.  The  other  machines 
are  similar  to  each  other  in  performance  and  in  the  nature  of  the  length 
of  the  code. 
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